Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove assert for alibi in case of FusedSDPA. #587

Open
wants to merge 1 commit into
base: habana_main
Choose a base branch
from

Conversation

itaraban
Copy link

@itaraban itaraban commented Dec 4, 2024

No description provided.

@itaraban itaraban force-pushed the fix_fused_main branch 3 times, most recently from b940234 to faa0f6a Compare December 4, 2024 09:56
@itaraban itaraban force-pushed the fix_fused_main branch 2 times, most recently from bf62a27 to d732bbe Compare December 4, 2024 10:12
@itaraban itaraban marked this pull request as ready for review December 5, 2024 23:44
@@ -212,8 +209,7 @@ def _compile_region(self, model, name, module):

def _set_attn_bias(self, attn_metadata, batch_size, seq_len, device,
dtype):
if (attn_metadata is None or self.prefill_use_fusedsdpa
or not attn_metadata.is_prompt):
if (attn_metadata is None or not attn_metadata.is_prompt):

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why did you remove prefill_use_fusedsdpa here?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants